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AMENDMENTS TO THE SPECIFICATION : 

Please replace the Abstract of the Disclosure with the 
following rewritten Abstract which appears on a separate sheet. 

Page 1, before line 4, insert the following heading: 
— BACKGROUND OF THE INVENTION — 

Page 2, replace the paragraph beginning on line 4, which 
was added in the Preliminary Amendment of March 25, 2002, with the 
following amended paragraph: 

--Another method described in document US-A-5 625 767 
enables thcrmatic thematic classification to be performed on the 
basis of a statistical analysis of the document. However, that 
method requires the documents to be manually claooif cd classified 
beforehand . -- 

Page 2, between lines 9 and 10, insert the following 

heading : 

— SUMMARY OF THE INVENTION 

Page 2, replace the paragraph beginning on line 10 and 
bridging pages 2 and 3 with the following amended paragraph: 

--The invention thus provides a method of thematically 
classifying documents, in particular for making up or updating 
thematic databases for a search engine, the method being 
characterized in that it comprises the following steps: 

- selecting a sample of documents representative of 
each theme; 
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- identifying within the selected documents elements 
that are characteristic of each theme; 

- allocating a coefficient to each identified element, 
which coefficient is representative of the relevance of said 
element relative to the corresponding theme; 

- downloading documents from a computer network; 

for each downloaded document to be classified, 
identifying said theme-characterizing elements that are contained 
in the document for each of the themes, and for each theme 
corresponding to the documents, using the coefficients allocated 
to said elements to calculate the a characteristic value -&€ — a- 
characteristic representative of the relevance of that theme for 
the document [[,] ] in order to decide whether or not the document 
relates to the theme, said theme- characterizing elements 
identification and calculation steps being performed 
automatically for each document downloaded from [ [a] ] the 
computer network; 

- classifying the downloaded documents as a function of 
the themes with which they deal; and 

- storing the documents classified thematically in 
databases that can be interrogated on the basis of themes 
contained in a request; 

and in that the step of allocating said coefficient to 
each identified element comprises the following steps for each 
theme : 



Docket No. 0512-1024 
Appln. No. 10/088,895 

- calculating [[the]] a frequency of the element in the 
selected documents relating to the theme; 

- calculating [ [the] ] a frequency of the element in the 
selected documents that do not relate to the theme; and 

- calculating the ratio of the calculated frequencies . - 
Page 4, replace the paragraph beginning on line 23 and 

bridging pages 4 and 5 with the following amended paragraph: 

— The invention also provides a module for thematically 
classifying documents, in particular for a search engine, the 
module being characterized in that it comprises a central 
processor unit having means for comparing elements extracted from 
each document with elements characteristic of various themes, 
each element being allocated a coefficient representative of the 
relevance of said element for a corresponding theme, and means 
for calculating the a characteristic value ^ — a^fe — lcaot — one 
characteristic representative of the relevance of a theme for the 
document on the basis of the coefficients of said characteristic 
elements that the document contains, in order to decide whether 
or not the document relates to said theme, said central unit 
being connected to means for storing documents classified by 
theme that can be interrogated on the basis of themes contained 
in a request, and in that the module has means for calculating 
the frequency of. the element in the selected documents relating 
to the theme, means for calculating the frequency of the element 
in the selected documents that do not relate to the theme, and 
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means for calculating the ratio between the calculated 
frequencies . -- 

Page 5, between lines 17 and 18, insert the following 

heading : 

— BRIEF DESCRIPTION OF THE DRAWINGS — 

Page 5, between lines 28 and 29, insert the following 

heading : 

— DESCRIPTION OF THE PREFERRED EMBODIMENTS — 

Page 9, replace the paragraph beginning on line 6 with 
the following amended paragraph: 

— To perform this calculation, the classification 
module begins by calculating the mean value R^^ of the ratios R 
of the characteristic elements of each theme (otop 36) . -- 

Page 9, replace the paragraph beginning on line 22 with 
the following amended paragraph: 

— After this stage of acquiring thematic vocabulary, 
implemented using a corpus of documents representative of various 
themes, the second stage of thematic classification proper can be 
performed in order to make up thematic databases given overall 
numerical reference 42 from documents 5_0 collected automatically 
from the computer network by robots such as 44. — 

Page 9, replace the paragraph beginning on line 2 9 with 
the following amended paragraph: 

--These documents 5^ are input to the thematic 
classification module which also receives an indication of the 
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theme nomenclature 18 and the elements available from the outcome 
of above-mentioned step 34 . This module proceeds automatically 
to calculate the themes on which a document relates (step 46). — 

Page 11, replace the paragraph beginning on line 1 with 
the following amended paragraph: 

--It will be understood that this procedure for 
automatically calculating the themes of documents 5_0 downloaded 
by the robots 44 enables the indexing module of a search engine 
to classify these documents as a function of the themes dealt 
with and to build up the thematic databases 42.-- 



